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Abstract: Bryum argenteum Hedw. is a desiccation tolerant bryophyte and belongs to one of the most 
important components of the biological soil crusts (BSCs) found in the deserts of Central Asia. Limited 
information is available on rehydration-responsive proteins in desiccation tolerant plants. As a 
complement to our previous research analyzing the rehydration transcriptome, we present a parallel 
quantitative proteomic effort to study rehydration-responsive proteins. Bryophyte gametophores were 
desiccated (Dry) and rehydrated for 2 h (R2) and 24 h (R24). Proteins from Dry, R2 and R24 
gametophores were labeled by isobaric tags for relative and absolute quantitation ((TRAQ) to determine 
the relative abundance of rehydration-responsive proteins. A total of 5503 non-redundant protein 
sequences were identified and 4772 (86.7%) protein sequences were annotated using Gene Ontology (GO) 
terms and Pfam classifications. Upon rehydration 239 proteins were elevated and 461 proteins were 
reduced as compared to the desiccated protein sample. Differentially up-regulated proteins were classified 
into a number of categories including reactive oxygen species scavenging enzymes, detoxifying enzymes, 
Late Embryogenesis Abundant (LEA) proteins, heat shock proteins, proteasome components and 
proteases, and photosynthesis and translation related proteins. Furthermore, the results of the correlation 
between transcriptome and proteome revealed the discordant changes in the expression between protein 
and mRNA. 
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1 Introduction 


Water is a scarce resource in drylands that cover about 41% of the Earth surface (Reynolds et al., 
2007; Maestre et al., 2012). In order to improve crop yields in drylands, researchers have made 
great efforts towards the study of plants adaptation under increased water deficit conditions. For 


“Corresponding author: ZHANG Daoyuan (E-mail: zhangdy @ ms.xjb.ac.cn) 

Received 2016-08-15; revised 2017-11-13; accepted 2017-11-20 

© Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, Science Press and Springer-Verlag GmbH Germany, 
part of Springer Nature 2018 


http://jal.xjegi.com; www.springer.com/40333 


GAO Bei et al.: Desiccation tolerance in bryophytes: the rehydration proteomes of Bryum argenteum... 


example, in tracheophytes, many constitutive traits (e.g., deeper roots, reduced leaf area, 
increased root to shoot ratio, etc.) contribute to water deficit tolerance (Xiong and Zhu, 2002). But 
in the respect of desiccation tolerance (DT), which refers to the ability of an organism to endure 
severe water loss (i.e., water contents below 10% (w/w)), is lethal to most vascular plants, and is 
very rare in angiosperms (Oliver et al., 2000a; Gaff and Oliver, 2013). In the desiccated state, 
tolerant organisms appear completely dry due to the free water in the organism is almost totally 
removed, but rapidly regain normal physiological characteristics after rehydration (Bewley, 1979). 
Among vascular plants, DT of vegetative tissues is restricted to resurrection plants, although 
dormant seeds and pollen grains can, with some exceptions, survive during the desiccation (Gaff 
and Oliver, 2013). However, numerous bryophyte species possess desiccation-tolerant vegetative 
tissues (Oliver et al., 2000a; Wood, 2007) although the phenomenon has been comprehensively 
studied in a limited number of moss species. 

Several omics-level efforts have been made to elucidate the mechanisms of rehydration in DT 
moss (Wood et al., 1999; Oliver et al., 2004, 2009). For example, Bewley (1979) and Oliver (1991) 
found that masked transcripts are preferentially selected and translated through activation of a 
repair-based mechanism. Recently, we demonstrated that the important role of transcript stability 
and translational gene control as the key components in DT moss by using the high throughput 
RNA-Seq technique (Gao et al., 2015). Meanwhile, a set of dehydration/rehydration proteomes in 
the Physcomitrella patens and Fontinalis antipyretica has been reported (Wang et al., 2009; Cui et 
al., 2012; De Carvalho et al., 2014). However, both of them are not recognized as the DT moss. 
Therefore, resuscitation mechanism of the DT moss in the proteome-level is not clearly studied. 

Bryum argenteum Hedw. is a cosmopolitan species of the Bryaceae and belongs to one of the 
most important components of the biological soil crusts (BSCs) found in the Northwest China (Li 
et al., 2004; Zhang et al., 2007). According to the classification of vegetative DT levels proposed 
by Wood (2007), B. argenteum was classified as category “A” (i.e., tolerating 30% relative 
humidity or below). In the past few years, we studied the transcriptomics of B. argenteum by 
high-throughput RNA-Seq to elucidate the molecular mechanisms of resuscitation, especially the 
rapid recovery of photosynthesis upon rehydration (Gao et al., 2015). However, the rehydration 
proteomes of B. argenteum were not concerned. Consequently, we investigated the relative 
abundance of rehydration-responsive proteins in the extremely DT moss during the process of 
dehydration-rehydration by using the iTRAQ (isobaric tags for relative and absolute quantitation) 
proteome relative quantification technology. 


2 Materials and methods 


2.1 Plant material 


Gametophores of Bryum argenteum Hedw. were collected from the Gurbantunggut Desert of 
Xinjiang Uygur Autonomous Region of China (44°32'30’N, 88°06'42”E) and stored as previously 
described by Gao et al. (2015). Voucher specimens are maintained in the Key Lab of 
Biogeography and Bioresearch, Chinese Academy of Sciences and Department of Plant Biology, 
Southern Illinois University (Carbondale, IL, USA). B. argenteum gametophyte tissue was 
cultured on solid Knop media (0.25 g/L KH2PO.a, KCI, MgSOa-7H20, 1 g/L Ca(NO3)2-4H20 and 
0.0125 g/L FeSO4-7H20, pH adjusted to 5.8) in 9-cm Petri dishes. Gametophores of B. argenteum 
(approximately 60 clumps per Petri dish) were cultured at 25°C with a 16-h photoperiod (under 
cool white fluorescent light, ~4000 lux). Desiccated moss tissues were obtained by placing 
gametophores on dry filter paper in an open Petri dish exposed to 20% relative humidity for 24 h. 
This drying regime results in a >95% decrease in the fresh weight of moss tissue (Gao et al., 2015). 
Desiccated gametophores (denoted as Dry) were rehydrated for either 2 h (denoted as R2) or 24 h 
(denoted as R24) with deionized water at 18°C in the light. Each biological replicate included 6 
clumps of moss from an individual Petri dish and 6 biological replicates were pooled for each 
treatment (i.e., Dry, R2, and R24). Plant tissues were immediately frozen in liquid nitrogen and 
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stored at —80°C for subsequent analysis. 


2.2 Protein preparation, digestion, iTRAQ labeling, strong cation exchange and liquid 
chromatography-tandem mass spectrometry (LC-MS/MS) proteomic analysis 


For each sample, gametophores collected from 6 Petri dishes (i.e., 36 clumps, 6 clumps from each 
plate) were pooled. The iTRAQ proteome experiment and mass spectrum analyses were 
performed by Beijing Genome Institute, China and the detailed i1TRAQ procedures were 
referenced from Peng et al. (2015) and Zhang et al. (2015). Firstly, tissues were ground into fine 
powder in liquid nitrogen, and suspended in the lysis buffer (7 M Urea, 4% CHAPS, 2 M 
Thiourea, 40 mM Tris-HCl, pH 8.5, 1 mM PMSF, and 2 mM EDTA). The proteins were reduced 
with DTT (10 mM, final concentration) at 56°C for 1 h and then alkylated by 55 mM IAM (final 
concentration) in the darkroom for 1 h. The reduced and alkylated protein mixtures were 
precipitated by adding 4 volumes of chilled acetone at —20°C overnight. After centrifuging at 
30,000 g at 4°C, the pellet was dissolved in 0.5 M TEAB (Applied Biosystems, Milan, Italy) and 
sonicated on ice. Then after centrifuging at 30,000 g at 4°C, the supernatant was analyzed for 
protein concentration using the Protein Assay Kit (Bio-Rad, Hercules, CA, USA) referenced the 
Bradford method using a BSA standard. The quality and purity of each protein sample was 
evaluated by SDS-PAGE electrophoresis (Ross et al., 2004; Peng et al., 2015). 

Subsequently, equal amounts (100 ug) of proteins of each sample were digested with trypsin 
enzyme (Promega, Madison, WI, USA; added at 0 and 4 h) for 12 h at 37°C to get tryptic peptides. 
The iTRAQ labeling of peptide samples derived from desiccated (Dry) and rehydrated (R2 and 
R24) samples were performed using an iTRAQ reagent Multiplex Kit (Applied Biosystems, 
Foster City, CA, USA) according to the standard manufacturer’s protocol. The tryptic peptides 
were then labeled by three iTRAQ tags (114 for Dry, 116 for R2, and 119 for R24, respectively) 
for 2 h at room temperature. Then, the samples were mixed and dried after labeling. The Dry, R2, 
and R24 peptides were combined in equal parts and further fractionated using the strong 
cation-exchange (SCX) chromatography (LC-20AB, Shimadzu, Japan). The labeled peptides were 
resuspended in 4 mL buffer A (25 mM NaĦH2PO; in 25% acetonitrile, pH 2.7). The following 
SCX chromatography was performed on a HPLC system (Shimadzu, Japan) using a 250.0 
mmx4.6 mm (5-um particle size) Ultremex SCX column (Shimadzu, Kyoto, Japan) at a flow rate 
of 1.0 mL/min. A 7-min elution was conducted by buffer B (a combination of 25 mM NaH2PO, 
and 1 M KCl in 25% acetonitrile, pH 2.7), and followed by 20-min linear gradient with 5%—60% 
buffer B, then ramping up to 100% solvent B for 1 min and re-equilibrated in 5% solvent B for 10 
min. The absorbance at 214 nm was monitored and a total of 20 fractions were collected, 
lyophilized, and desalted using the Strata X column (Shimadzu, Kyoto, Japan). Eluents were 
lyophilized and stored at —-80°C prior to LC-ESI-MS/MS analysis. 

Each lyophilized SCX fraction was redissolved in solvent A (5% acetonitrile, 0.1% FA). 
Liquid chromatography was performed using a LC-20AD nano-HPLC (Shimadzu, Japan) and 
loaded by an autosampler onto a 2-cm C18 capillary trap cartridge. The samples were loaded at 8 
uL/min for 4 min and then separated on a 15-cm analytical C18 column (75 um inner diameter, 
LC Packings). The 35-min linear gradient starting from 2% to 35% B (95% ACN, 0.1% FA) was 
run at 300 uL/min and followed by 5 min linear gradient to 60%, then a 2-min linear gradient to 
80%, and maintained at 80% B for 2 min, and finally returned to 5% in 1 min and maintained for 
10 min. 

Data acquisition was performed with a TripleTOF 5600 System (AB SCIEX, Framingham, MA) 
and a pulled quartz tip as the emitter (New Objectives, Woburn, MA). Data was acquired using an 
ion spray voltage of 2.5 kV, curtain gas of 30 psi, nebulizer gas of 15 psi, and an interface heater 
temperature of 150°C. The MS was performed with an RP of greater than or equal to 30,000 full 
width at half maximum (FWHM) for Time of Flight (TOF) MS scans. For information dependent 
acquisition (IDA), survey scans were acquired in 250 ms and as many as 30 product ion scans 
were collected if exceeding a threshold of 120 counts per second and with a 2+ to 5+ charge-state. 
Total cycle time was fixed to 3.3 s. Q2 transmission window, where ions were transmitted from 
Q1 into the TOF region, was 100 Da for 100%. Four-time bins were summed for each scan at a 
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pulser frequency value of 11 kHz through monitoring the 40 GHz multichannel time to digital 
convertor (TDC) detector with four-anode channel detect ion. A sweeping collision energy setting 
of 3545 eV coupled with iTRAQ rolling collision energy was applied to all precursor ions for 
collision-induced dissociation. Dynamic exclusion was set for 1/2 of peak width (15 s), and then 
the precursor was refreshed off the exclusion list. 


2.3 Bioinformatic analyses 


Raw MS files obtained from the Triple TOF 5600 System were transformed using Proteome 
Discoverer v1.2 software (Thermo Fisher Scientific, CA, USA) and the transformed file (mfg files) 
was subsequently loaded into Mascot v2.3.02 software ( Matrix Science, London, UK) to search 
against the protein sequences deduced from the de novo B. argenteum transcriptome sequence 
assembly. The proteome data have been deposited in the MassIVE database, a member database 
of the ProteomeXchange consortium, with accession number of MSV000079840 
(ftp://massive.ucsd.edu/MS V000079840). The B. argenteum transcriptome sequences can be 
accessed from the NCBI SRA database (http://www.ncbi.nlm.nih.gov/bioproject/PRJNA272646/). 
And the parameters used in Mascot for protein identification were referenced from Peng et al. 
(2015). Specifically, the identification parameters included: (1) mass tolerance of +0.05 Da was 
permitted for peptide masses, and (2) +0.1 Da for fragment masses with allowance for one max 
missed cleavage by trypsin. Variable modifications were obtained by Gln->pyro-Glu (N-term Q), 
Oxidation (M), and Deamidated (NQ). Fixed modifications were obtained by Carbamidomethyl 
(C), iTRAQ 8 plex (N-term) and iTRAQ 8 plex (K). To reduce the probability of false peptide 
characterization, we considered and counted peptides with scores >20 at 99% or greater 
confidence by Mascot probability analysis, and protein identification required at least one unique 
peptide. 

For protein quantification, the peptide for quantification was automatically selected by the 
algorithm to calculate the reporter peak area (using default parameters in Mascot), which reflected 
the relative abundance of the proteins in the samples. It was required that a protein contains at 
least two peptides. The quantitative protein abundance ratios were derived from the median 
peptide ratio, and outliers were removed automatically. The quantitative protein ratios were 
evaluated and normalized by the median ratio in Mascot. The quantitation analysis was performed 
at the peptide level by following the procedures described on the website 
(http://www.matrixscience.com/help/quant_statistics help.html). Proteins with fold change >1.5 
and statistical P-value <0.05 were determined as differentially regulated proteins. The student’s 
t-test was performed using the Mascot software. 

Pearson’s correlations (r) and corresponding P-values were calculated in R statistical software 
(v3.1.1). Transcription factors and protein kinases were identified using HMMER (v3.1) (Eddy, 
2011) and the classification rules implemented in iTAK (v1.6) (Zheng et al., 2016). Statistically 
enriched KEGG (Kyoto encyclopedia of genes and genomes) metabolic pathways of the regulated 
proteins were identified using the “phyper” function, and the obtained P-values were adjusted 
using the FDR method in R statistical software. Gene ontology enrichment analysis of the 
responsive proteins was conducted in Cytoscape (v2.8.3) (Smoot et al., 2011) using the plugin 
BiNGO (v2.44) (Maere et al., 2005). Significantly enriched pathways and Gene Ontology (GO) 
terms were identified with FDR corrected P-values less than 0.05. The web-based Mercator 
annotation tool (Lohse et al., 2014) was used to assign the identified proteins with a MapMan bin 
code. Protein diversity and abundance were analyzed using MapMan software (v3.5.1R2) (Thimm 
et al., 2004; Usadel et al., 2005). 


3 Results and discussion 


3.1 Rehydration proteome of B. argenteum 


In this study, B. argenteum gametophores that were desiccated (Dry), rehydrated for 2 h (R2) and 
rehydrated for 24 h (R24) were collected for proteome quantification, and the scheme for 
obtaining the rehydration proteome is presented in Figure la. A total of 793,811 spectra were 
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obtained from the iTRAQ-LC-MS/MS proteomic analysis. After data filtering to eliminate 
low-scoring spectra, 87,175 unique spectra passing a strict confidence threshold for identification 
were matched to 18,874 peptides. These peptides corresponded to 5503 non-redundant protein 
sequences, representing approximately 12% of the 45,918 predicted proteins from the B. 
argenteum rehydration transcriptome (Gao et al., 2015). 
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Fig. 1 Paralleled Bryum argenteum transcriptome and proteome profiling after rehydration. (a) Scheme 
delineating the experimental procedures applied to measure transcript and protein relative abundance changes at 
different hydration time points including Dry, R2, and R24. (b) Hierarchical clustering analysis with columns 
representing experimental time points and rows representing the proteins that were significantly regulated after 
rehydration and corresponding transcripts. (c) Venn diagram demonstrating the number of transcripts and proteins 
detected using DGE-Seq and iTRAQ based on the B. argenteum reference transcriptome. (d and e) Correlation 
between mRNA and protein changes 2 and 24 h post rehydration indicated a weak correlation between 
transcription and translation. The Pearson’s correlations and P-values are indicated. 


The proteome quantification data were preliminarily compared to the previously reported 
digital gene expression (DGE) results. And we didn’t find a significant correlation between them 
based on the expression profile clustering analysis (Fig. 1b). The mRNA amounts were drastically 
changed upon rehydration (log2FC min-max: —16.7 to 14.3), whereas the proteome undergoes 
limited changes (log2FC min-max: —4.7 to 1.95). The DGE-Seq reads aligned to a total of 47,481 
transcripts, but the iTRAQ proteome peptides only identified 5503 proteins, among which 355 
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proteins were solely identified by iTRAQ (Fig. 1c). Overall, the protein abundance change 
following rehydration was poorly correlated with that of the corresponding changes in mRNA 
expression. However, the protein expression changes correlated better at 24 h after rehydration 
than at 2 h after rehydration (Figs. 1d and e). It should be noted that poor correlation between the 
expression of mRNA and protein has been reported before (Gygi et al., 1999; Soufi et al., 2009; 
Lan et al., 2012; Peng et al., 2015). Furthermore, some metabolic traits are better correlated with 
mRNA amounts than with proteins (Agarwal et al., 2012). The insignificant correlation between 
mRNA and protein expression following rehydration could be partially explained by the fact that 
the desiccated and dormant B. argenteum as the initial hydrated plant material limited the 
“rehydration response” because some protection and rehydration related substances and/or 
transcripts were probably already in place i.e., hardened by field effects (Proctor et al., 2007; 
Oliver and Bewley, 2010; Stark, 2017). 

Querying the Blast2GO (Gene Ontology) annotation results of the B. argenteum reference 
transcriptome (Gao et al., 2015) resulted in 4200 (76.3%) protein sequences returning descriptive 
GO term information. Second-level GO term classification of the identified proteins demonstrated 
similar sequence frequencies across all three GO categories, i.e., biological process, molecular 
function and cellular component compared with the reference transcriptome (Fig. 2). In the 
biological process category, the most populous groups were cellular process, metabolic process, 
response to stimulus and biological regulation. The dominant groups were binding, catalytic and 
transporter in the molecular function category. In the cellular component category, the largest 
groups were cell, cell part, and organelle. 
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Fig. 2 Gene ontology (GO) classification for the B. argenteum proteome and comparison with the reference 
transcriptome. GO annotation results of the B. argenteum proteome and transcriptome were mapped to categories 
within the second-level of GO terms. GO terms that contain more than five sequences were included in this graph. 


Among all identified proteins, 3045 (55.3%) proteins could be assigned to COG (clusters of 
orthologous groups) functional categories. Since some proteins could be assigned with multiple 
COG functional categories, a total of 4136 functional annotations were produced (Table 1). 
Earlier genomic analysis of the model moss P. patens has identified a large number of orphan 
genes (at least 13% of the genome) and functionally unknown gene annotations (>40%) (Rensing 
et al., 2008; Zimmer et al., 2013). Similarly, analysis of the B. argenteum proteome also identified 
a high percentage of functionally unknown proteins (i.e., 45%). 

To scrutinize the protein diversity associated with different biological pathways, we queried all 
identified proteins against the KEGG pathway database and 3390 proteins were anchored to 124 


JOURNAL OF ARID LAND 


Table 1 Clusters of orthologous groups functional categories of all identified proteins in the rehydration 
proteome 


Code Functional category Proteins 
R General function prediction only 705 
(0) Post translational modification, protein turnover, and chaperones 451 
J Translation, ribosomal structure and biogenesis 383 
C Energy production and conversion 341 
G Carbohydrate transport and metabolism 326 
E Amino acid transport and metabolism 272 
I Lipid transport and metabolism 190 
K Transcription 153 
T Signal transduction mechanisms 150 
Q Secondary metabolites biosynthesis, transport and catabolism 149 
L Replication, recombination and repair 142 
H Coenzyme transport and metabolism 137 
P Inorganic ion transport and metabolism 136 
M Cell wall/membrane/envelope biogenesis 135 
U Intracellular trafficking, secretion and vesicular transport 120 
S Function unknown 99 
F Nucleotide transport and metabolism 75 
D Cell cycle control, cell division, chromosome partitioning 46 
Vv Defense mechanisms 38 
Z Cytoskeleton 36 
B Chromatin structure and dynamics 31 
A RNA processing and modification 16 
Y Nuclear structure 4 
N Cell motility 1 


KEGG pathways. The number of identified proteins, as well as the proteins with a countable 
quantification ratio (i.e. quantified proteins), involved in each of the second-level KEGG 
categories were depicted in Figure 3a. Among the 19 second-level KEGG pathway categories, 
carbohydrate metabolism, and translation contained the largest number of proteins, indicating 
their potentially important roles in the rehydration process. Furthermore, box-and-whisker plot 
analysis of protein expression change was performed to illustrate the overall protein level changes 
for each of the KEGG categories (Fig. 3b). From Figure 3b, a distinct decline in protein content 
could be observed for environmental adaption and transcription. Moreover, a dominant change of 
protein kinases and phosphates, such as the MAPK, CDPK, calmodulin, and EF hand 
calcium-binding proteins, was observed, suggesting these regulatory proteins played crucial roles 
in environmental adaptions (e.g., circadian rhythm and plant-pathogen interaction). Observed 
suppression of the transcription category could be associated with the release of masked mRNA 
pool upon rehydration (Oliver and Bewley, 1984; Oliver, 1991; Wood and Oliver, 1999). TFIIF1 
(unigene33350) and TFIIH4 (cl2790.cg2) are transcription initiation factors embedded in the 
basal transcription factors (eukaryotes) KEGG pathway, and both of them were significantly 
down-regulated following rehydration. To further illustrate the ratios of protein change involved 
in each metabolic pathway, we created the MapMan overview illustrations for both of R2 (Fig. 4a) 
and R24 (Fig. 4b) samples compared with the desiccated sample (Dry). 

Basing on the Pfam protein family database, we assigned the identified proteins with specific 
protein families/domains information and categorized the 4284 protein sequences (77.8%) into 
2117 Pfam families/domains. The most populous protein families/domains were RNA recognition 


GAO Bei et al.: Desiccation tolerance in bryophytes: the rehydration proteomes of Bryum argenteum... 


motif (RRM)-containing proteins, protein kinases and WD40 repeat-containing proteins (Table 2). 
Their high frequencies of occurrence indicated that these protein families may play pivotal roles 
in the B. argenteum resuscitation process. 
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Fig. 3 Proteome coverage and protein abundance changes of each second-level KEGG categories. (a) The upper 
panel showed the number of identified (dark) and quantified proteins (light grey) for each KEGG category. (b) 
Box-and-whisker plot of protein expression changes between rehydrated (R2 and R24) and desiccated (Dry) 
samples in different metabolic categories. 


3.2 Identification of differentially regulated proteins 


Differentially regulated proteins were identified by the iTRAQ quantitative proteomic analysis. 
Totally, 294 and 400 proteins were found to be significantly down-regulated after rehydrating 2 
and 24 h, respectively. The decline in protein content during rehydration is consistent with the 
proteomic analyses in other bryophytes (Wang et al., 2009; De Carvalho et al., 2014), and Oliver 


(1991) also 


found that the rates of protein synthesis are significantly altered during the first 2 h of 


rehydration. Meanwhile, 131 and 181 proteins were found to be significantly induced after 


rehydration 


2 h and 24 h, respectively. Moreover, up-regulated proteins were associated with 


some functional categories, which have been proved that protein-mediated repair mechanism was 
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Fig. 4 MapMan overview of B. argentum cellular metabolism changes after 2 h (a) and 24 h (b) of rehydration. 
Individual identified proteins are represented by colored squares. The color code scale is based on the log2 
transformed ratio of each protein. The squares colored with dark grey represent the identified proteins without 
quantification ratios. Red squares were associated with induced proteins and greens for repressed ones. 
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Table 2 Pfam families in the Bryum argenteum rehydration proteome 


Accession Pfam symbol Description Proteins Up/down 
PFO0076.17 RRM_1 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain) 53 0/17 
PF00400.27 WD40 WD domain, G-beta repeat 51 0/3 
PF00069.20 Pkinase Protein kinase domain 47 1/4 
PF00004.24 AAA ATPase family associated with various cellular activities 38 3/1 
PF12697.2 Abhydrolase_6 Alpha/beta hydrolase family 37 1/0 
PF00067.17 p450 Cytochrome P450 36 3/0 
PF00106.20 adh_short Short chain dehydrogenase 31 0/0 
PF00005.22 ABC_tran ABC transporter 30 1/2 
PF08240.7 ADH_N Alcohol dehydrogenase GroES-like domain 28 0/0 
PF00085.15 Thioredoxin Thioredoxin 27 5/1 
PF00012.15 HSP70 Hsp70 protein 24 1/0 
PF00501.23 AMP-binding AMP-binding enzyme 24 0/0 
PF00107.21 ADH_zinc_N Zinc-binding dehydrogenase 23 1/0 
PFO0504.16 Chloroa_b-bind Chlorophyll A-B binding protein 23 1/4 
PF13414.1 TPR_11 TPR repeat 22 0/2 
PF13460.1 NAD_binding 10 NADH(P)-binding 22 1/0 
PFO0153.22 Mito_carr Mitochondrial carrier protein 21 0/1 
PF00270.24 DEAD DEAD/DEAH box helicase 21 0/1 
PF00271.26 Helicase _C Helicase conserved C-terminal domain 21 0/1 
PF03144.20 GTP_EFTU_D2 Elongation factor Tu domain 2 21 0/0 
PF00227.21 Proteasome Proteasome subunit 20 2/0 
PF00160.16 Pro_isomerase Cyclophilin type peptidyl-prolyl cis-trans isomerase/CLD 19 0/1 
PF00168.25 C2 C2 domain 19 1/0 
PF13499.1 EF-hand_7 EF-hand domain pair 19 2/2 
PFO0009.22 GTP_EFTU Elongation factor Tu GTP binding domain 18 0/1 
PF00118.19 Cpn60_TCP1 TCP-1/cpn60 chaperonin family 18 2/0 
PF00171.17 Aldedh Aldehyde dehydrogenase family 18 2/0 
PF00248.16 Aldo_ket_red Aldo/keto reductase family 18 5/0 
PF07714.12 Pkinase_Tyr Protein tyrosine kinase 18 3/1 


an important aspect of rehydration (Oliver and Bewley, 1984, 2010; Oliver, 1991; Oliver et al., 
2004). On the other hand, a total of 4804 (87.3%) proteins were insignificantly regulated by the 
rehydration (Fig. 5a). At the same time, the rehydrated samples (i.e., R2 vs R24) were compared 
by using the protein ratios. We found that there is a significant correlation (r=0.91) between R2 
and R24 in the protein changes (Fig. 5b), which suggested the limited protein changes upon 
extended rehydration, as well as the reliability of the proteome quantification data. 


3.3 Function enrichment analysis of the rehydration responsive proteins 


To survey relationships of enriched metabolic pathways and functional categories with 
rehydration, we performed KEGG pathway and hierarchical GO category enrichment analyses 
according to the P-values calculated on hypergeometric distribution. A series of KEGG pathways 
affected by rehydration with values of P<0.05 were identified. The results demonstrated that five 
KEGG pathways were differentially regulated after 2 h rehydration including spliceosome 
(KO03040), ribosome (KO03010), RNA transport (KO03013), sulfur relay system (KO04122) 
and photosynthesis (KO00195). Three KEGG pathways, i.e., photosynthesis, spliceosome, and 
RNA transport were enriched in differentially regulated proteins after 24 h rehydration. 
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Fig. 5 Detection of the rehydration regulated proteins. (a) Number of differentially regulated proteins upon 
rehydration; (b) correlation analysis between R2 and R24 rehydrated samples. 
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The significant induction of stress-related genes after 2 h of rehydration at the transcript level 
has been previously demonstrated (Gao et al., 2015). The enrichment of ribosome pathway 
(KO03010) at R2 treatment indicated that altered protein biosynthesis was a critical cellular 
process during the early rehydration phase. The biosynthesis of molybdenum cofactor (Moco) is 
part of the sulfur system and essential to the function of several enzymes such as sulfite oxidase, 
aldehyde oxidase and xanthine dehydrogenase. These enzymes help metabolize a variety of toxic 
substances and have been previously characterized to be inducible by drought-stress in 
Arabidopsis thaliana (Bittner et al., 2001; Wollers et al., 2008). Close observation showed that a 
MOCS2 protein (molybdopterin biosynthesis MoaE family protein (unigenel1401)) was 
up-regulated upon rehydration. The photosynthesis pathway (KO00195) was differentially 
regulated by rehydration at both 2 and 24 h. Though a part of the proteins involved in 
photosynthesis were down-regulated, four F-type ATPase proteins were all up-regulated after 24 h 
of rehydration, suggesting the importance of ATPase function in the resuscitation after desiccation. 
The recovery of photosynthesis upon rehydration was delayed and the proteomic data were 
consistent with those results, i.e., positive carbon balance was restored within 30 min and 
photosynthesis was fully recovered after 24 h (Bewley, 1979; Oliver and Bewley, 1984, 2010; 
Oliver et al., 2000a; Li et al., 2014). Our results demonstrated that protein changes in early 
rehydration played a critical role in the restoration of cellular damages introduced by 
dehydration/desiccation. 

GO term enrichment analysis of the differentially regulated proteins was conducted using 
BiNGO, a Cytoscape plugin assessing over-representative terms with the whole B. argenteum 
transcriptome being the background. All rehydration up- and down-regulated proteins (Fig. 5a) 
were subjected for functional enrichment analysis to get their functional categories relating with 
rehydration responsive proteins (Fig. 6). Interestingly, two biologically significant processes (i.e., 
photosynthesis and translation) were identified in both of the up- and down-regulated GO 
networks. Though the number of down-regulated proteins was larger than that of up-regulated 
ones, the number of statistically enriched up-regulated GO terms was more than that of 
down-regulated (Fig. 6), suggesting higher functional aggregative degree of induced proteins. The 
repression of proteins was dispersed among broader functional categories and pervasive protein 
loss could be regarded as a universal event during rehydration after desiccation. 

Among rehydration up-regulated proteins, a number of rehydration-related biological processes 
were significantly enriched including carbohydrate metabolic process, response to abiotic 
stimulus, generation of precursor metabolites and energy, catabolic process and biosynthetic 
process. Each of these processes was hypothesized to play an important role in the recovery 
process upon rehydration. And all of the results illustrated the importance of translational control 
and photosynthesis recovery, accompanied by the cellular component of ribosome and 
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plastids/thylakoid. Furthermore, elevated metabolic (both biosynthetic and catabolic) activities 
could also be observed in the GO enrichment network. 
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Fig. 6 GO-slim term enrichment network analysis for the up-regulated (a) and down-regulated (b) proteins upon 
rehydration. Node size represented the number of proteins in node and node filled color represented P-value. 
White nodes denote non-significantly enriched GO terms. 


3.4 Major families of RNA-binding proteins declined upon rehydration 


Earlier study of Tortula ruralis (Oliver et al., 2000b), proposed the strategy to overcome 
desiccation revolves around the polysomal retention of transcripts during dehydration (Oliver and 
Bewley, 1984; Oliver, 1991; Wood and Oliver, 1999) and these retained transcripts are translated 
upon rehydration through activation of a repair-based mechanism (Oliver, 1991; Oliver et al., 
1997). Previous study in the yeast Saccharomyces cerevisiae illustrated that 70% of the mRNA 
transcriptome had significant associations with at least one of the examined RNA-binding 
proteins (RBP), which strongly suggested binding of RBPs to mRNAs is a pervasive mechanism 
for regulation of their post-transcriptional fate (Hogan et al., 2008). 

In this study, 17 RRM containing proteins were significantly down-regulated, suggesting a 
crucial role in the rehydration process. Other protein families with most populous members 
include WD40 proteins, various kinds of protein kinases, ATPase family and hydrolase family 
(Table 2). However, compared with the RRM containing proteins, only a small fraction of these 
protein family members were rehydration responsive, though high frequency of occurrence of 
these protein families were detected. We were also able to find 9 KH domain containing proteins 
and 3 were down-regulated upon rehydration. According to the COSMOSS (v1.6) and KEGG 
annotation, cl2734.cg1 was a KH domain-containing protein and classified as a C3H transcription 
factor. However, the other 2 KH domain-containing proteins (cl2892.cg1 and cl676.cg3) were 
annotated as, respectively, poly (rC)-binding protein and far upstream element-binding protein, 
probably indicating their different RNA-binding sites. 


3.5 Metabolic, ROS scavenging and detoxifying related enzymes 


Upon rehydration, two up-regulated aminoacyl-tRNA ligases were discovered to be 
rehydration-responsive. In the fructose and mannose metabolism pathway, two enzymes 
(aldehyde reductase (EC: 1.1.1.21) and triose-phosphate isomerase (EC: 5.3.1.1)) were induced. 
Four enzymes in the “starch and sucrose metabolism” pathway were up-regulated upon 
rehydration. In the fatty acid metabolism, fatty acid biosynthesis and fatty acid elongation 
pathways, 5 induced proteins were observed. The up-regulation of proteins associated with 
translation, carbohydrate metabolism and lipid anabolism are consistent with a comprehensive 
recovery of metabolism following rehydration. 

Eighteen aldehyde dehydrogenases were identified in the B. argenteum proteome (Table 2). 
Two succinic semialdehyde dehydrogenases (ALDH5 or SSADH; EC 1.2.1.16) were significantly 
up-regulated upon rehydration, which contribute to restrict levels of reactive oxygen 
intermediates in plants (Kirch et al., 2004; Brocker et al., 2013). ALDH5 mutations in plants have 
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been shown to cause enhanced accumulation of reactive oxygen intermediates and cell death in 
response to light and heat stress (Bouché et al., 2003). Eighteen putative glutathione S-transferase 
(GST, EC: 2.5.1.18) proteins were identified, among which 4 GSTs (cl4387.cg1, cl858.cg2, 
unigene6153 and unigene33419) were up-regulated. The remaining 14 GST proteins, 5 
glutathione reductase (GR, EC: 1.8.1.7) proteins and 4 glucose-6-phosphate dehydrogenases 
(G6PDH, EC: 1.1.1.49) proteins were not up-regulated during rehydration. In addition to 
scavenging the free radicals formed during desiccation, the redox couple of glutathione is also 
involved in protecting protein thiol-groups from desiccation-induced injury by forming 
protein-bound glutathione (PSSG) (Kranner, 2002; Kranner et al., 2003; Djilianov et al., 2011). In 
this way, SH-groups are protected from both irreversible formation of intramolecular disulphide 
bonds and oxidation to sulphonic acids (Kranner and Grill, 1996). 


3.6 Late Embryogenesis Abundant (LEA) and heat shock proteins 


The LEA proteins have been postulated to be associated with drying plant tissues and played very 
important roles in maintaining cellular integrity when water was reintroduced (Oliver et al., 2004). 
The LEA superfamily could be classified into seven groups according the Pfam database 
annotation (i.e., LEA_1 through LEA_5, dehydrin and SMP) (Hundertmark and Hincha, 2008). 
Using the proteome Pfam annotation information, 5 LEA_2, 1 LEA_4, 4 LEA_5, 3 Dehydrin, and 
1 SMP were discovered. Among the 14 identified LEA proteins, 4 LEA proteins (2 LEA_4 group, 
1 LEA_5 group, and 1 dehydrin protein) were found to be down-regulated. This observation is 
consistent with the reported P. patens proteome data (Cui et al., 2012). 

A total of 43 Heat Shock Proteins (HSPs) was identified. The HSP70 and DnaJ (HSP40) 
subfamilies were the most abundant and the identified HSP proteins include 2 HSP20, 15 HSP40, 
3 HSP60, 18 HSP70, 4 HSP90, and 1 CLPB. The HSP60 (unigene26811) and HSP40 
(unigene7095) proteins were up-regulated during rehydration. In particular, another HSP60 
(unigene26811) protein was up-regulated after 2 h rehydration and was more than 3.0-fold 
accumulation after 24 h rehydration. And another HSP40 (unigene7095) protein was 2.6-fold 
up-regulated, and then decreased to the desiccated level after 24 h rehydration. However, a 
HSP70 protein (unigene32310) was down-regulated following rehydration. 

Rehydration responsive LEA proteins were down-regulated whereas several small molecular 
mass HSPs were induced upon rehydration; similar LEA/HSP accumulations in Fontinalis 
antipyretica were also observed (De Carvalho et al., 2014). HSP and LEA proteins are postulated 
to play pivotal roles in desiccation-tolerance by protecting protein structure and function in P. 
patens (Wang et al., 2009; Cui et al., 2012) and T. ruralis (Bouché et al., 2003; Oliver et al., 2004; 
Oliver et al., 2009), and the accumulation of HSP/LEAs during rehydration is an important 
component of the recovery in B. argenteum. 


3.7 Proteasomes and proteases 


The main function of the proteasome is to degrade unneeded or damaged proteins by proteolysis, 
a chemical reaction that breaks peptide bonds (Etlinger and Goldberg, 1977). Five proteasome 
proteins were differentially regulated after rehydration (unigene33241, cl830.cg2, cl1909.cg1, 
unigene33315, and cl1954.cg1). Components of the 26S proteasome, regulatory subunit N12 
(Rpnl2, unigene33241) and T4 (Rpt4, cl830.cg2), were up-regulated 2.4-fold and 1.7-fold, 
respectively. Rpt4 is predicted to encode an AAA-type ATPase family protein, suggesting 
enhanced demand for cellular energy derived from ATP hydrolysis (Glickman and Adir, 2004). 
Components of the 20S complex, subunit alpha 4 (cl1909.cg1) and beta 4 (unigene33315) were 
also up-regulated. Interestingly, a protein inhibitor of 20S proteasome activity, inhibitor subunit 1 
(PI31, cl1954.cg1) was down-regulated following rehydration (Zaiss et al., 2002). 

Proteases are a diverse group of proteolytic enzymes (van der Hoorn, 2008), which is 
hypothesized to degrade damaged proteins during desiccation (Cui et al., 2012; De Carvalho et al., 
2014). In this rehydration proteome, 10 responsive proteases were identified. A lon protease 
(cl696.cg8) and an ubiquitin-specific protease (unigene28814) were down-regulated upon 
rehydration. The induced proteases (8 out of 10) were 5 CLP protease components (c139.cg6, 
unigene33017, unigene29679, unigene32606, and cl3031.cg1), an ubiquitin-specific protease 
(unigene5413), an aspartyl protease (unigene25407), and a presequence protease (unigene3 1663). 
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Additionally, a serine protease inhibitor family protein (Serpin, unigene12375) was significantly 
up-regulated. The up-regulation of proteasome components and proteases during rehydration 
suggested that the removal of damaged proteins and recycling of polypeptides was a prominent 
feature of rehydration. 


3.8 Transcription associated proteins 


To further understand the gene regulation networks, we analyzed the proteome for 94 
transcription associated proteins (TAPs), including transcription factors (TFs) and transcription 
regulators (TRs) dispersed among 38 TAP families (26 TF families and 12 TR families). And 
among the 94 TAPs, orphan transcriptional regulator was the most populous, containing 11 family 
members, 6 of which were putative component (i.e., histidine kinase) of the two-component 
signal transduction system. In the entire proteome, 19 DNA-directed RNA polymerase proteins 
were identified, 3 of which (unigene42474, unigene25526, and unigene34104) were significantly 
up-regulated. At the same time, the MYB TF family was the most populous in the 26 TF families, 
comprising 8 members, 3 of which (unigene384, c13238.cg1, cll1435.cg1) were down-regulated 
after rehydration. Overall, the down-regulated TAPs include 8 TFs and 3 TRs, including 3 MYB 
(unigene384, cl3238.cg1, c11435.cg1), 2 C3H (cl839.cg2, cl2734.cg1), 2 C2H2 (unigene5094, 
unigene10) and 1 ARR-B (unigene11580), and 3 TRs were SWI/SNF-BAF60b (cl2414.cg1), CSD 
(cl987.cg1) and pseudo ARR-B (cl366.cg11). Three TAPs were up-regulated after rehydration, 
including a TRAF transcriptional regulator (unigene12565), an orphan transcriptional regulator 
(unigene23867) and a LOB transcription factor (cl4978.cg1). This comprehensive proteome-wide 
identification and quantitative analyses of rehydration responsive TAPs provide potential 
candidates for further stress biology experiments. 


4 Conclusions 


In this study, we analyzed the rehydration proteomes of the DT bryophyte B. argenteum using 
iTRAQ technology. In order to eliminate the field effects, we cultured the material in the lab and 
regenerated the gametophores with a uniform genetic background. Using this experimental 
strategy, we discovered that 12.7% of the identified 5503 proteins were differently regulated. 
After 2 h of rehydration, 131 proteins were up-regulated and 294 proteins were down-regulated. 
And after 24 h rehydration, 181 proteins were up-regulated and 400 proteins were down-regulated. 
Protein amounts were significantly altered during the first 2h rehydration and up-regulated 
proteins after 2 and 24 h rehydration represented rehydration-associated repair/recovery 
mechanisms. These mechanisms include: (1) enhanced translation and protein synthesis, (2) 
active ROS scavenging, (3) accumulation of LEAs and HSPs, (4) recovery of photosynthesis, and 
(5) the release of mRNAs masked in the messenger ribonucleoprotein particles (nRNPs) formed 
upon desiccation. 
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